How to Group Crashes Effectively: Comparing Manually and Automatically Grouped Crash Dumps
نویسندگان
چکیده
Crash dumps have become an important source for software developers to learn quality issues in released software. Since a same bug can be repeatedly triggered by different users, an overwhelming number of crash dumps are returned daily. Techniques for automatically grouping crash dumps are mostly based on call stacks captured at crash sites; although fast, they can incorrectly group irrelevant crash dumps or miss related crash dumps. The goal of this paper is to compare manually and automatically grouped crash dumps and discover grouping criteria that are effectively used in manual diagnosis but lacking in automatic tools, from which we potentially enable more precise and capable crash diagnostic tools. In our study, we compared a total of 1,550 groups and 30,431 crash dumps from 5 Mozilla applications. We find that 1) call stacks are more dissimilar from each other in manual groups, as besides matching call stacks, developers frequently link multiple sources, such as reproduce steps and revision histories, for grouping crashes; 2) while automatic tools focus on grouping crashes based on the same root cause, developers also correlate crash dumps across different versions of programs or even different applications for related root causes; 3) both automatic and manual approaches are imprecise, but the two make different types of mistakes; and 4) for small applications, developers’ domain knowledge on code works more effectively than the automatic approach for correlating crashes. From the study, we learn that to more effectively group crashes, our future tools should 1) enable multiple criteria and explore diverse uses of crash dump groups for prioritizing and fixing bugs, 2) correlate multiple sources of information or even multiple applications, and 3) uncover inherent relations between symptoms and source code. Keywords-Crash Dumps, Call Stacks, Grouping, Similarity
منابع مشابه
Analyzing Motorcycle Crash Pattern and Riders’ Fault Status at a National Level: A Case Study from Iran
Motorcycle crashes constitute a significant proportion of traffic accidents all over the world. The aim of this paper was to examine the motorcycle crash patterns and rider fault status across the provinces of Iran. For this purpose, 6638 motorcycle crashes occurred in Iran through 2009-2012 were used as the analysis data and a two-step clustering approach was adopted as the analysis framework....
متن کاملAn Investigation on Crash Worthiness of Different Vehicle Brands: A Case Study of Rollover Crashes
This study aimed at indexing crash worthiness capability of 20 most frequently used car brands in Iran. Since rollover crashes are one of the most important crash types due to their high impact on crash severity, they were chosen as the case study of the current research. In this regard, the data of 42,118 rollover crashes of urban and rural roads of Iran which occurred from 2009 to 2012 was us...
متن کاملPriority Order for Improvement of Intersections using Pedestrian Crash Prediction Model
One of the most important needs of pedestrians is safety at crossing points, especially at intersections. Intersections are important parts of the urban road network because any disruption in them reduces the capacity of the entire network. The main objective of this research is to propose an appropriate method for prioritizing urban intersections with considering the important factors affectin...
متن کاملDevelopment of Models for Crash Prediction and Collision Estimation- A Case Study for Hyderabad City
Road traffic crash is a cause of unnatural death and occupies fifth position in the world as per WHO records. Road crashes in India are alarming in situation while road safety is professionally lacking and politically missing. Hyderabad city, the capital of newly formed Telangana State occupies sixth position in occurrence of road crashes. An attempt is made to understan...
متن کاملThe Unreasonable Effectiveness of 1 Traditional Information Retrieval in Crash
6 Organizations like Mozilla, Microsoft, and Apple are flooded with thousands of automated crash reports per day. Although crash reports contain valuable information for debugging, there are often too many for developers to examine individually. Therefore, in industry, crash reports are often automatically grouped together in buckets. Ubuntu’s repository contains crashes from hundreds of softwa...
متن کامل